Surface Electromyography Based Speech Recognition System and Development Toolkit by Daniel Chang Thesis

نویسنده

  • Timothy W. Bretl
چکیده

This thesis describes the implementation of an automatic speech recognition system based on surface electromyography signals. Data collection was done using a bipolar electrode configuration with a sampling rate of 5.77 kHz. Four feature sets, the shorttime Fourier transform (STFT), the dual-tree complex wavelet transform (DTCWT), a non-causal time-domain based (E4-NC), and a causal version of E4-NC (E4-C) were implemented. Classification was performed using a hidden Markov model (HMM). The system implemented was able to achieve an accuracy rate of 74.24% with E4-NC and 61.25% with E4-C. These results are comparable to previously reported results for offline, single session, isolated word recognition. Additional testing was performed on five subjects using E4-C and yielded accuracy rates ranging from 51.8% to 81.88% with an average accuracy rate of 64.9% during offline, single session, isolated word recognition. The E4-C was chosen since it offered the best performance among the causal feature sets and non-causal feature sets cannot be used with real-time online classification. Online classification capabilities were implemented and simulations using the confidence interval (CI) and minimum noise likelihood (MNL) decision rubrics yielded accuracy rates of 77.5% and 72.5%, respectively, during online, single session, isolated word recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Integration of a Communication System for Social Behavior Analysis in the SocialRobot Project

Human-Robot-Interaction is a vast research field with great potential and importance for the assignment of communication capabilities to robots. Dialogs can be considered as one of the most common ways of communication between humans. Through the auditory system, humans can locate the person with whom they are dialoguing, listen to what is said and perceive the emotional state of that person. I...

متن کامل

پایه‌گذاری بستری نو و کارآمد در حوزه بازشناسی گفتار فارسی

Although researches in the field of Persian speech recognition  claim  a  thirty-year-old  history in Iran  which has achieved considerable progresses, due to the lack of well-defined experimental framework, outcomes from many of these researches are not comparable to each other and their accurate assessment won’t be possible. The experimental framework includes ASR toolkit and speech database ...

متن کامل

Multimodal Silent Speech Interface based on Video, Depth, Surface Electromyography and Ultrasonic Doppler: Data Collection and First Recognition Results

Silent Speech Interfaces use data from the speech production process, such as visual information of face movements. However, using a single modality limits the amount of available information. In this study we start to explore the use of multiple data input modalities in order to acquire a more complete representation of the speech production model. We have selected 4 non-invasive modalities – ...

متن کامل

A Spectral Mapping Method for EMG-based Recognition of Silent Speech

This paper reports on our latest study on speech recognition based on surface electromyography (EMG). This technology allows for Silent Speech Interfaces since EMG captures the electrical potentials of the human articulatory muscles rather than the acoustic speech signal. Therefore, our technology enables speech recognition to be applied to silently mouthed speech. Earlier experiments indicate ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009